421 research outputs found

    Identifying Web Tables - Supporting a Neglected Type of Content on the Web

    Full text link
    The abundance of the data in the Internet facilitates the improvement of extraction and processing tools. The trend in the open data publishing encourages the adoption of structured formats like CSV and RDF. However, there is still a plethora of unstructured data on the Web which we assume contain semantics. For this reason, we propose an approach to derive semantics from web tables which are still the most popular publishing tool on the Web. The paper also discusses methods and services of unstructured data extraction and processing as well as machine learning techniques to enhance such a workflow. The eventual result is a framework to process, publish and visualize linked open data. The software enables tables extraction from various open data sources in the HTML format and an automatic export to the RDF format making the data linked. The paper also gives the evaluation of machine learning techniques in conjunction with string similarity functions to be applied in a tables recognition task.Comment: 9 pages, 4 figure

    A nonmitochondrial hydrogen production in Naegleria gruberi

    Get PDF
    Naegleria gruberi is a free-living heterotrophic aerobic amoeba well known for its ability to transform from an amoeba to a flagellate form. The genome of N. gruberi has been recently published, and in silico predictions demonstrated that Naegleria has the capacity for both aerobic respiration and anaerobic biochemistry to produce molecular hydrogen in its mitochondria. This finding was considered to have fundamental implications on the evolution of mitochondrial metabolism and of the last eukaryotic common ancestor. However, no actual experimental data have been shown to support this hypothesis. For this reason, we have decided to investigate the anaerobic metabolism of the mitochondrion of N. gruberi. Using in vivo biochemical assays, we have demonstrated that N. gruberi has indeed a functional [FeFe]-hydrogenase, an enzyme that is attributed to anaerobic organisms. Surprisingly, in contrast to the published predictions, we have demonstrated that hydrogenase is localized exclusively in the cytosol, while no hydrogenase activity was associated with mitochondria of the organism. In addition, cytosolic localization displayed for HydE, a marker component of hydrogenase maturases. Naegleria gruberi, an obligate aerobic organism and one of the earliest eukaryotes, is producing hydrogen, a function that raises questions on the purpose of this pathway for the lifestyle of the organism and potentially on the evolution of eukaryotes

    Investigations on the Peach 4 Debrite, a Late Pleistocene Mass Movement on the Northwest British Continental Margin

    Get PDF
    The Peach 4 debrite is the most recent in a series of large scale Pleistocene MTDs within the Barra fan on the northwest British continental margin. Geophysical data indicate that Peach 4 was formed through a combination of blocky and muddy debris flows and affects an area of ~ 700 km2. BGS core sample 56 -10 36, located directly over the Peach 4 debrite, provides a minimum age of 14.68 ka cal BP for the last major failure. An upwards fining turbidite sequence in BGS core sample 56 -10 239 is associ-ated with increased As and S concentrations, indicators of diagenetic pyrite which forms under anoxic conditions. It is proposed that As and S concentrations may pro-vide a method of distinguishing between contourite and turbidite sedimentation, though further research is required

    A Port Graph Rewriting Approach to Relational Database Modelling

    Get PDF
    International audienceWe present new algorithms to compute the Syntactic Closure and the Minimal Cover of a set of functional dependencies, using strategic port graph rewriting. We specify a Visual Domain Specific Language to model relational database schemata as port graphs, and provide an extension to port graph rewriting rules. Using these rules we implement strategies to compute a syntactic closure, analyse it and find minimal covers, essential for schema normalisation. The graph program provides a visual description of the computation steps coupled with analysis features not available in other approaches. We prove soundness and completeness of the computed closure. This methodology is implemented in PORGY

    Informational Gene Phylogenies Do Not Support a Fourth Domain of Life for Nucleocytoplasmic Large DNA Viruses

    Get PDF
    Mimivirus is a nucleocytoplasmic large DNA virus (NCLDV) with a genome size (1.2 Mb) and coding capacity ( 1000 genes) comparable to that of some cellular organisms. Unlike other viruses, Mimivirus and its NCLDV relatives encode homologs of broadly conserved informational genes found in Bacteria, Archaea, and Eukaryotes, raising the possibility that they could be placed on the tree of life. A recent phylogenetic analysis of these genes showed the NCLDVs emerging as a monophyletic group branching between Eukaryotes and Archaea. These trees were interpreted as evidence for an independent “fourth domain” of life that may have contributed DNA processing genes to the ancestral eukaryote. However, the analysis of ancient evolutionary events is challenging, and tree reconstruction is susceptible to bias resulting from non-phylogenetic signals in the data. These include compositional heterogeneity and homoplasy, which can lead to the spurious grouping of compositionally-similar or fast-evolving sequences. Here, we show that these informational gene alignments contain both significant compositional heterogeneity and homoplasy, which were not adequately modelled in the original analysis. When we use more realistic evolutionary models that better fit the data, the resulting trees are unable to reject a simple null hypothesis in which these informational genes, like many other NCLDV genes, were acquired by horizontal transfer from eukaryotic hosts. Our results suggest that a fourth domain is not required to explain the available sequence data

    Multiple phase transitions in a system of exclusion processes with limited reservoirs of particles and fuel carriers

    Full text link
    The TASEP is a paradigmatic model from non-equilibrium statistical physics, which describes particles hopping along a lattice of discrete sites. The TASEP is applicable to a broad range of different transport systems, but does not consider the fact that in many such systems the availability of resources required for the transport is limited. In this paper we extend the TASEP to include the effect of a limited number of two different fundamental transport resources: the hopping particles, and the "fuel carriers", which provide the energy required to drive the system away from equilibrium. As as consequence, the system's dynamics are substantially affected: a "limited resources" regime emerges, where the current is limited by the rate of refuelling, and the usual coexistence line between low and high particle density opens into a broad region on the phase plane. Due to the combination of a limited amount of both resources, multiple phase transitions are possible when increasing the exit rate beta for a fixed entry rate alpha. This is a new feature that can only be obtained by the inclusion of both kinds of limited resources. We also show that the fluctuations in particle density in the LD and HD phases are unaffected by fluctuations in the number of loaded fuel carriers, except by the fact that when these fuel resources become limited, the particle hopping rate is severely reduced

    Analysis of the Sam50 translocase of excavate organisms supports evolution of divergent organelles from a common endosymbiotic event

    Get PDF
    As free-living organisms the ancestors of mitochondria and plastids encoded complete genomes, proteomes and metabolomes. As these symbionts became organelles all these aspects were reduced – genomes have degenerated with the host nucleus now encoding the most of the remaining endosymbiont proteome, while the metabolic processes of the symbiont have been streamlined to the functions of the emerging organelle. By contrast, the topology of the endosymbiont membrane has been preserved, necessitating the development of complex pathways for membrane insertion and translocation. In this study, we examine the characteristics of the endosymbiont-derived β-barrel insertase Sam501 in the excavate super-group. A candidate is further characterized in Trichomonas vaginalis, an unusual eukaryote possessing degenerate hydrogen-producing mitochondria called hydrogenosomes. This information supports a mitochondriate eukaryotic common ancestor with a similarly evolved β-barrel insertase, which has continued to be conserved in degenerate mitochondria

    Triggering mechanism and tsunamogenic potential of the Cape Fear Slide complex, U.S. Atlantic margin

    Get PDF
    Author Posting. Š American Geophysical Union, 2007. This article is posted here by permission of American Geophysical Union for personal use, not for redistribution. The definitive version was published in Geochemistry Geophysics Geosystems 8 (2007): Q12008, doi:10.1029/2007GC001722.Analysis of new multibeam bathymetry data and seismic Chirp data acquired over the Cape Fear Slide complex on the U.S. Atlantic margin suggests that at least 5 major submarine slides have likely occurred there within the past 30,000 years, indicating that repetitive, large-scale mass wasting and associated tsunamis may be more common in this area than previously believed. Gas hydrate deposits and associated free gas as well as salt tectonics have been implicated in previous studies as triggers for the major Cape Fear slide events. Analysis of the interaction of the gas hydrate phase boundary and the various generations of slides indicates that only the most landward slide likely intersected the phase boundary and inferred high gas pressures below it. For much of the region, we believe that displacement along a newly recognized normal fault led to upward migration of salt, oversteepening of slopes, and repeated slope failures. Using new constraints on slide morphology, we develop the first tsunami model for the Cape Fear Slide complex. Our results indicate that if the most seaward Cape Fear slide event occurred today, it could produce waves in excess of 2 m at the present-day 100 m bathymetric contour.Acquisition of new data was funded by NOAA Ocean Exploration grant NA03OAR4600100 to C.R., and we thank the National Science Foundation for contributing to transit costs for the ship

    Eruptive modes and hiatus of volcanism at West Mata seamount, NE Lau basin : 1996–2012

    Get PDF
    Author Posting. © American Geophysical Union, 2014. This article is posted here by permission of American Geophysical Union for personal use, not for redistribution. The definitive version was published in Geochemistry, Geophysics, Geosystems 15 (2014): 4093–4115, doi:10.1002/2014GC005387.We present multiple lines of evidence for years to decade-long changes in the location and character of volcanic activity at West Mata seamount in the NE Lau basin over a 16 year period, and a hiatus in summit eruptions from early 2011 to at least September 2012. Boninite lava and pyroclasts were observed erupting from its summit in 2009, and hydroacoustic data from a succession of hydrophones moored nearby show near-continuous eruptive activity from January 2009 to early 2011. Successive differencing of seven multibeam bathymetric surveys of the volcano made in the 1996–2012 period reveals a pattern of extended constructional volcanism on the summit and northwest flank punctuated by eruptions along the volcano's WSW rift zone (WSWRZ). Away from the summit, the volumetrically largest eruption during the observational period occurred between May 2010 and November 2011 at ∼2920 m depth near the base of the WSWRZ. The (nearly) equally long ENE rift zone did not experience any volcanic activity during the 1996–2012 period. The cessation of summit volcanism recorded on the moored hydrophone was accompanied or followed by the formation of a small summit crater and a landslide on the eastern flank. Water column sensors, analysis of gas samples in the overlying hydrothermal plume and dives with a remotely operated vehicle in September 2012 confirmed that the summit eruption had ceased. Based on the historical eruption rates calculated using the bathymetric differencing technique, the volcano could be as young as several thousand years.Support for R.W.E. during this study was by internal NOAA funding to the NOAA Vents Program (now Earth-Ocean Interactions Program). The NSF Ridge 2000 and MARGINS programs played a major role in the planning and justification for the 2009 rapid response proposal that funded the May 2009 expedition. MBARI provided support and outstanding postprocessing of the multibeam bathymetry from the D. Allan B. AUV multibeam sonar used in this study. NSF also provided major funding for the 2009 expedition (OCE930025 and OCE-0934660 to JAR) and for the 210Po-210Pb radiometric dating (OCE-0929881 and for the 210Po-210Pb radiometric dating (OCE-0929881 to KHR)). The NOAA Office of Exploration and Research provided major funding for the 2009 and 2012 field programs.2015-04-3
    • …
    corecore